Articulatory Synthesis of Speech and Singing: State of the Art and Suggestions for Future Research
نویسندگان
چکیده
Articulatory synthesis of speech and singing aims for modeling the production process of speech and singing as human-like or natural as possible. The state of the art is described for all modules of articulatory synthesis systems, i.e. vocal tract models, acoustic models, glottis models, noise source models, and control models generating articulator movements and phonatory control information. While a lot of knowledge is available for the production and for the high quality acoustic realization of static spoken and sung sounds it is suggested to improve the quality of control models especially for the generation of articulatory movements. Thus the main problem which should be addressed for improving articulatory synthesis over the next years is the development of high quality control concepts. It is suggested to use action based control concepts and to gather control knowledge by imitating natural speech acquisition and singing acquisition scenarios. It is emphasized that teacherlearner interaction and production, perception, and comprehension of auditory as well as of visual and somatosensory information (multimodal information) should be included in the acquisition (i.e. training or learning) procedures.
منابع مشابه
An Articulatory-Based Singing Voice Synthesis Using Tongue and Lips Imaging
Ultrasound imaging of the tongue and videos of lips movements can be used to investigate specific articulation in speech or singing voice. In this study, tongue and lips image sequences recorded during singing performance are used to predict vocal tract properties via Line Spectral Frequencies (LSF). We focused our work on traditional Corsican singing “Cantu in paghjella”. A multimodal Deep Aut...
متن کاملArticulatory synthesis of singing
A system for the synthesis of singing on the basis of an articulatory speech synthesizer is presented. To enable the synthesis of singing, the speech synthesizer was extended in many respects. Most importantly, a rule-based transformation of a musical score into a gestural score for articulatory gestures was developed. Furthermore, a pitch-dependent articulation of vowels was implemented. The r...
متن کاملEngineering of Membrane Gas Separation Processes: State of The Art and Prospects
Membrane processes are today one of the key technologies for industrial gas separations and show growing interest for future use in sustainable production systems. Besides materials development, dedicated engineering methods are of major importance for the rigorous and most efficient design of membrane units and systems. Starting from approaches based on simplified hypotheses developed in the 5...
متن کاملVirtual Talking Heads and audiovisual articulatory synthesis
Our approach to audiovisual articulatory synthesis involves the development of Virtual Talking Heads that integrate the articulatory, aerodynamic and acoustic phenomena underlying speech production. Specifically, these Talking Heads are faithful clones of the speakers whose data the various models are based on. Our contribution presents some of the results achieved at ICP in this domain: 3D oro...
متن کاملInterdisciplinary Approaches for Advancing Articulatory Speech Theory and Synthesis
Articulatory synthesis research has long been dominated by frequency domain and concatenate samplebased speech synthesis techniques. While successful in some domains (e.g., voice-based databases), these techniques still cannot produce natural looking and sounding speech from text for an arbitrary speaker. Natural looking and sounding speech technology is one of the next major milestones in voic...
متن کامل